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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Fred Hutchinson Cancer Research Center, Inc. 

(B) STREET: 1100 Fairview Avenue North, Mailstop C2M-027 

(C) CITY: Seattle 

(0) STATE: Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 98109 

(A) NAME: Thomas Spies 

(B) STREET: 2429 E. Aloha 

(C) CITY: Seattle " " " 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 98112 

(A) NAME: Veronika Spies 

(B) STREET: 2429 E. Aloha 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 98112 

<ii) TITLE OF INVENTION: CELL STRESS REGULATED HUMAN MHC CLASS I GENE 
(iii) NUMBER OF SEQUENCES: 16 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM : PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPQ) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/029,044 

(B) FILING DATE: 29-OCT-1996 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CACTGCTTGA GCCGCTGAGA GGGTGGCGAC GTCGGGGCCA TGGGGCTGGG CCCGGTCTTC 
CTGCTTCTGG CTGGCATCTT CCCTTTTGCA CCTCCGGGAG CTGCTGCTGG TGAGTGGCGT 
TCCTGGCGGT CCTCGGCGGA GCGGGAGCAG TGGGACGTTT CCGGGGGTCG GGTGGGTAGC 
GGCGAGCGCT GTGCGGTCAG GGCGGGGCTC CTGTGCCCTG TCGGTGGCGC AGGGAGCTGG 
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ACGCGGCCCG TTACCGCCAC ACTTCAGQCC TGCTTCCCCG TCACTTTTCA GTCCTCCTCG 300 

GGATCGCGCA TCACCTGCAC TTTCTGGTCT CCTCCTGCTC TTTCTCTCCT CGCGTCTCCT 360 

CCGCTTCCTC TCACTTTTCG GACAAACCAG TCCTTCTGAG GCCCATGGGT TCCCGGGCTG 420 

CCTCCGGGGC TGCTCCTGTG AATGGCATTC GAGTGCCCTT CCAGCGCGGC CACTGAAGCA 480 

GCCACAACCC CCGGTGCTCG GGGCGGCTCT CAGGTCCCTG AAGTCCTGTC CTCTCCCGGA 540 

GCCGACGTGT TCTCAGCTCC TGGGCCGCAG CTCCTGGAGT AGGGGCCCTC CTTTCTCGGG 600 

ACCCGGAGCT GGTGCTTCCT GCTGCTGTGG GGACTGTGGG GGGTCCTGAC TCTCAAGCTG . . . £6j0 

AGGGGTTGGA GTCTGCAGGC TCCGGGCAGA GGATTCTTCC TGCGACTTCT CTCATCCCCA 720 

GCTCATTCTC CCCTCGCCTC TGGCTCCGAG GGTCCTCTCC TCTCTCTCAT CCCACCCCTA 780 

CTAATGACCA GTGATCTAAG GACACCAGAT TCCCTCTCAC CTCCTCCCTG CCCATCTCAG 840 

GGCCCGCTGA GTCCTTTTGC CCTCCCAGCT CCCTGCTACC CCTTCCTGTG TGCTGTTCTC 900 

TGATCCATTT CTAGGGTGTC CTCTGCCCTC ATCCCCTGTC CCCGCCACCG AAGTCCCTCC 960 

TGCACCCCTT ATGGGCCTTT CCTACAAGCA GCCTTCACCC AGTGCTGCCC CTATGCCTCC 1020 

CCGTTCCCAA ATGTCCCTGA CTCTAACTTT CTGGTGCTGC CTTTTATCCG GGGGGGTCTT 1080 

CCCTCCATCC CACTCCCCTC CAGACCCCCA AGGGGAACCC TGATGCTAAT GGCAGTTGGG 1140 

CCTTAGGCAG GGCGCAGGGC AGCGCAGATG CCCCCTCCCC TCCAGTGCAG ATGCCTGTTC 1200 

TGGACCCTGC CTCATTGTGG CCCCTTCCCC ACTCCTTCAT CCTCAGCCTC ACCCTCTTGA 1260 

GGACCCCACC CTCCAGCCCA CAGGTGCTGG ACCATCCCTC CCTGGTCCCT CCGCCCCTCT 1320 

CCACCTTGGG ACCTTGTGCT GCTCCTATCT CTTGCCCAGC TGCCTTGGGC CCTCAGCACG 1380 

TTCTCATCTT TCAGTGGGAA AGTGGGAGTG CTGGAGCATA TGACAGTGCT GAGCATCTTT 1440 

CCCAAGCCCC ACCCTCCCCC AGAGCACCCT CCCCTCCTGT CCTCACCCTA CCCCAAGTTC 1500 

TCCCACAGTC ACTCCTGCCC CATGCTCATG CCGCCCTCCA GTTCTTGCTC TGCCCATCTC 1560 

CCCTCCCCAA CCCAGACCTA AAACAGGCTG TTGGGCCAAC TGTTCCTTGA CCTTCCTTCT 1620 

TTTCTTTTGG TTCCTTGACC CCAGTGGGCT CTCACTCCCC ACACCGCATA TCTAAAATCT 1680 

GTTTTGCCTG CTCTTGGGGT GCCACTGCTC CCCCTCCAGC ATTACTCCTT TTGGCAGGTC 1740 

CTTCCTCAGG CTGAGAATCT CCCCCTCTAC CTTGGTTTTC TCTCTCTGGC CAGCACCCCC 1800 

ACTCCTTGCT TTGTTTTTAA TTTTTAACTT TTGTTTGGGT ACGTAGTAGA TATATATGTA 1860 

TATATTTATG GGGTACATGG GATATTTTGA CACAGGCCTA CAATATGTAA TAATCACATC 1920 

AGGGTAAATG GGTTATATCA CAACAAGCAT TTATCCTTTC TTTGTGCTAC AAACAATCCC 1980 
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ATTATGCTCT TTCAGTTATT TTTAAATGTA 
GCTGTGCTAT CTACTAGATC TTATTCATTC 
TCCCTGCTCC CCXaCTCCCC ACTACCCTTC 
TCTCCCCATG AGGTCCATTG TTTTAAATTT 
AGTTTGTCTG TCTGGGCCTG GGGCTTATTT 
AAATGACACG ATGGCTGAAT AGTTCTCCAC 
ATGCGTCTGT TGATGGACAC TTAGATTGCT 
CAATAAACAT GGAAAAGTAG ATAGCTCTTT 
TGCCTAACAG TGGGAGTGCT GGAGCATATG 
GAACCTCCAC ATTGTTTCCC ATAGTGGTTG 
ATCCTCACCA GCATTCCTTA TTTCTACATC 
GGATAAAAGC CAGTTTATCT GGGGTGGGAT 
ATCTGTTGAC GAATGATGTT GAGCACCTTT 
CTTTTGAGAA ATGACTATTC AGATCTTTTC 
TTCCTATAGT TGTTCGAGCT CCTTATATGT 
GTTTGAAAAT ATTTTCTCCC ATTCTTGGAT 
GCTGTGCAGA AGCCTTTTTA CTTGATATGA 
TGTGCTTGTG GGGTATTACT TTAAAAATCT 
CCAATGTTTT CTTGTATAGT TTCATAGTTT 
TTGATTTGAT TTTTGTATAT GGTGAAAGAC 
TATCTAGTTT CCCCAGCACC ATTTTTGAAG 
TACCTTTGTT GGAAATGAGT TTACTGTAGA 
TGTTTCATTG GTCTGTGTGT CTGTTTTTAT 
CTCTGTAGTA TAATTTGAAG TCAGATAATG 
GGATAGCTTT ATCTATTCTG GTTTTTTTGT 
TTATTTCTGT GAAGAATGTC ATTAGTGTTT 
CTTTGGGTAG TATGGATATT TCAACAAAAC 
CTTTTCCATT TTTTGTGTCC TTCAATTTTT 
AGATGGAGTT TCACTCTTGT TGCCCAGGCT 



CAATAAATTA TTGTTGACTG TACTCACCCT 
TAATTATATT TTTGTACCCA TTATTAACCA 
TCAGCCTCTG GTAATCATCA TTCTATTGTC 
TGGCTGCCAC AAATAAGTGA GAACATGCAA 
CACTTCACAG GATGACCTCC AGTTCTTTGC 
ATACACATGT ACACCACATT TTCTTTATCC 
TGCAGATCTT GGCTACTTTG _ _ AATAGTGCTG 
AATATACCGA TTTCCTTTCT TTGGAGTATA 
ACAGCTCTAT TGTATTTTTA GTTTTTGGAA 
TACTAGTTTA CGTTCCCACC AACAGTGTAC 
CTCGCCAGCA TTCCTTATTG CCTGTCTTCT 
GTTATCTCGT AGGAGTTTTG ATTTGCCTTC 
TCATATACCT GTTTGCCATT TATATGTCTT 
TCATTTTTAA ATTGGATTAT TATATTTTTT 
TTCAGTTACT GATCCTTTGT CAGATGAATA 
GGTCTCTTCA TTTTGTTTAT TGTTTCCTTT 
TCCCATTTAT GCAATTTTAC TTTGGTTACC 
TTGCCCAGTC CAATATCCTA GAGAGTTTCC 
GAGGTCATAG ATTTACATCT TTAATCCACT 
AGGGTCTAGT TTCATTCTTC TGCATAAGGA 
AGACTCTCCT TTGCCAATGT GTGTTCTTGG 
TGTATGGAAT TGTTTCTGGG TTCTCTATTC 
GCCAGTATCA TGCTGTTTTG GTTACTGTAG 
TGATTCCTCT AGTTTTGTTC ATTTTGCTCA 
GGTTCCATAT GCATTTTAGG ATTATTTTTA 
TGATAGGGAT TGCATTGAAT CTGTAGATTA 
TGATTCTTCC AATCCATGAA CGTGGACTAT 
TGCATCAGTG TTTTTTGTTT TTGGTTTTTG 
AGAATGCAAG GGTGTGATCT TGGCTCACCG 
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CAACCTCCGC CTCCCAGGTT CAAGCTATTC 
TACAGGCATG TGCCACTGTG CCTGGCTAAT 
TATGTTGGCC AGGCTAGTCT TGAACTCCTG 
CAAAGTGCTG GGATTACAGG CATGAGCCAC 
TTTATTGGAG AGGTCTTTCA CTTCTTCAGT 
TGTAGCTATT GTAAATGGGA TTCGTTTCTT 
ACTGATTTTT GCATGTTGAT- TTTGTATCCT- 
AATGGTTTTT TGGTGGAGTC TTTAGGTTTT 
AAGGATAATT TGACTTCTTC TTTTCCAGTT 
ATTGCTCTAG TTAGGATCTG CAGTACTGTG 
TGTCTTATTC CAGATCTTAG AGAAAAGGCT 
GCTGTGAGTT TGTCATATAT GGCTTTTATT 
TTTTTGAGAG TTTTTATCAT GAAGGGATGT 
TTGAATGATA CTGGCTTTTG TCCTTTATTC 
TGTGTATGTT AAATCATCCT TGCATACCTG 
TCTTTTTTAA TGTATTGTTG AATGTGGTTT 
GTGTTCATCA GGGATATAGG CCTGTAGTTT 
TTGTATCAGG ATATTCCTGG CTTTGTAAAA 
TTTTCAGAAC AGTTTGAATA GGACTGACAT 
AATTATACAT TACATAAATT TTACTGTTTT 
TAGATACATT CACATTTTTG TGCAACCCAA 
ATTCCTCCCT ACCTCTGGCC CCTGGTAACC 
ACCACTCTAG GTACCTCATT TAAGCAGAAT 
TATTTCACTT ATAATATTTT TGAGGTTCGG 
GCACTTTGGG AGGCTGAAGC AGGTGGATCA 
CCAACATGGT GAAAC CCCAT CTCTACTAAA 
GGTGCCTGTA ATCCCAACTA CTTGGGAGGC 
GTGGAGGTTG CAGTGAGCTG AGATCAGGCC 
AATTCCATCT CCAAAAAAAA AAAATAAAAC 



TTCTGCCTCA GCCTCCCAAG TAGCTGGGAT 
TTTCTATTTT TATTAGAGAT GGGGTTTCTC 
ACCTCAGGTG ATCCACCTGC CTCGGCCTCC 
CACGCCCAGC CACATCACTG TTTTATAGTT 
TAGGTTTATT CCTCAGTATT TTATTTTATT 
GATTTCTTTT TCAGATTATT TGCTGTTAGC 
GCAACTTTAC TGAATTTGTT CTTCAGTTCT 
TCCAAATATC AGACCACATG ATCTGCAAAC 
TTAATGCCCT TTCTTTCTTT CTCCTGTCTG 
TTGCATAACT GTGGTAAAAT TAGTCATCCT 
TTCAGTTTTC CCCCATTCAG TATGTTACTA 
ATATTGAGGT CTGTTCCTTG TATACTTAGT 
TGAATTTATC AAATGCTTTT TCAGTATCAA 
TGTTGATATG ACGTATTACA TTGATTGATT 
GAATACATTC CACTTGCTCA TAAAGAATGA 
GCTAGTATTT CCTTGACGAT TTTTGCATCG 
TCTTTTTTAT GATGTGTCTT TGCCTGGTTT 
TGAGTTTGGA AGTATTCCCT CCTCCTCTAT 
ATGTTGTTCT TTAAAAGTTT AATTGTGGTA 
AACCACTTTT AAGTGTATAC TCGGTGGCAT 
AACTCTGTGC CCATTAATCG GTAACTCCCC 
ACCATTCTAC TTTTTGTTTC TATGAATTTG 
CATGTAATGT TTGTCTTTTT GTTTCTGGCT 
TGGGCACAGT GGCTCACGCC TGGATTTCCA 
CCTGAGTTTC GGAGTTCGAA ACCAGCCTGG 
AATAATAAAA GTTAGCCGGG CGTGATGGCG 
TGAGGCAGGA GAATCGCTTG AATCCGGGAA 
ACTGCACTCC AGCCTGGGCA ACAAGAGTGA 
AATAATAATA ATAATATTTT TGAGGTTCAT 
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CCAAGTTGTA GTATGGGTCA GAATTTCATT 
GTATGTACCA CATCTTGGTT ATCCATCCCT 
TTTTGGATAT TGGCAAATAT TTCATTTCCT 
TCTTTTGGGT ATATATCCAG AAATAGAAGC 
TTTGCCAACC TTGCTCTGTG TGTGTGTGTA 
TCCTGATTGG TTTCAGGTGG CATCTCATTG 
CTGATATTGA GCATCTTTTC ATGTGTTTGT 
GCCATTTAAG TCTTTTGCCC ATTTTTTCCC 
TTGCAATATT TATTTCATTT CGGGGTTGAT 
GCATAGATGT TTTGAATTTT CATCAGTCTA 
TTTGGTGTCA TATCCATGAA AGCACTGTCA 
TTTGCTTCTA AGAAATTTTT AGGTTTTAGT 
ATTTTGAGTT AATTTTTGTA TATAGTGCAA 
CTGCCCCCAG AACTATTTGC TGAAAAGATC 
TGGACACTAG CTGTTCCATC CAATTGCTGT 
TTGAACCCAA GCCCACACCG TTCGTTGCTC 
TCTCTGGGGC TACAACCTTC CTACCCTTTG 
GCCATGCCCA TGATGAGTCT CTTCTCGAGG 
CCTATGCACC ACAGCCACAC TGGACATGGG 
CACCATCTCC TCTGGCAAGA CCTTCCTTCC 
AGGGCAGCCT CTCCCCTTGG CCCCTATTCC 
CACCTGGCCT ACAAGTTTGC CATCTTCATT 
CTATCCTCCC ACCCTCACAG TTTTCTTTGT 
CGTGTGCATT TCCTGCCCCA GGAAGGTTGG 
TGAAGTTACT TCCAGGAAGA AGTTTCACCT 
CTTCGTTATA ACCTCACGGT GCTGTCCTGG 
GAGGTACATC TGGATGGTCA GCCCTTCCTG 
CCCCAGGGAC AGTGGGCAGA AGATGTCCTG 
GACTTGACAG GGAACGGAAA GGACCTCAGG 



CCTTTTAAGG ATGGATAATA CTCATTATAT 
CAGACAATGG ACACTTGGGT TACTTCTACC 
TTGGGTATAT ATTTATTTCC TTTGGGTATT 
AGTACACAGG GGCTTCATTT TCTCTGTCTC 
TGTGTGTGTG TAGGTGTGTG ATAACAGCCA 
TGGTTTGGAT TTGCATTTTC CTAATGAGTG 
TGATCATTTG^ TAATTTTCTT _XGAAGAA3TG 
CCACATAGCT TCTCTTATCA GATATATGAC 
TGCTTTTTCA CTCTGATTGT GCCCTTTGAT 
CTTTGTCAGT TCTTTCTATT CTATCTGTGC 
AATCCTATGT CATGAACATT ATCCCCAATG 
TCTTGAGTGT AGAGTTTAGG TCTTTGATTC 
ATTAAGGGTC CAATTTTATT TTAACACCCC 
AACTGACTCT TTGTCACCTG CTCACCCCAG 
CCTGGGGCCT TGTCATGCTA CTCTTCCACT 
CCCTCTGGGA TACTGACCCC ACTATAAACT 
TGCCTCATGA CCACCCCCTC CCTTGTCCCC 
CAGCTCCCCT TGCCTCCATC TCACCCTCAG 
TCCCTCTGAG CCTGAGTCCC TTCCCATTCC 
ACCACCTTCA TGCTCCTCCC TTGCCCCTGC 
CTTAGGGGGC TTGTGGCCAC CCAGTCCTTG 
CCCCCTTCTT CTGTTCATCA GCCCCCTCCT 
ATATGAAATC CTCGTTCTTG TCCCTTTGCC 
GACAGCAGAC CTGTGTGTTA AACATCAATG 
GTGATTTCCT CTTCCCCAGA GCCCCACAGT 
GATGGATCTG TGCAGTCAGG GTTTCTTGCT 
CGCTATGACA GGCAGAAATG CAGGGCAAAG 
GGAAATAAGA CATGGGACAG AGAGACCAGG 
ATGACCCTGG CTCATATCAA GGACCAGAAA 
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GAAGGTGAGA GTCGGCAGGG GCAAGAGTGA 
GCAGAGAGCA GGGACCTGTC TCTTCCCACT 
GGGGTCAGTG GAACTCAGCA GGGAGGTGAG 
GGGAGGGCCA GGGAGGCGTA CCCCCTGGGC 
GGTTCGGGAA TGGAGAAGTC ACTGCTGGGT 
TAGGGTCTGT GAGATCCATG AAGACAACAG 
TGGGGAGCTC TTCCTCTCCC AAAACGTGGA 
CAGAGCTCAG ACCTTGGCCA TGAACGTCAG 
CAAGACACAC TATCACGCTA TGCATGCAGA 
ATCCAGCGTA GTCCTGAGGA GAAGAGGTAC 
CAATTCTGCT AGAGTTGCCT CACCTCCAAG 
GGATGAAGGC ATTTCCTGTT GGCACATCGT 
TGGATAAAGA CAGTGGGTCA GGGACTGGAC 
GACCCTCCGA CAGAATCCTG AGCCTGTGGT 
GCCAGGGCTG CCCCCTCTGC CTCCCAGCCT 
ACAACCCAGG AGTCCACCCC TGACATCCCC 
CCTGAGGCCA CAGTCCCAAG GCCCATCCTC 
TGAGGACAGA CTTGCAGGTC AGGGGTCCCG 
GAGAAACAGC CCTGTTCCTC TCCCCTCCTT 
CTTTCTTCTC CAGTGCCCCC CATGGTGAAT 
ATCACCGTGA CATGCAGGGC TTCCAGCTTC 
CAGGATGGGG TATCTTTGAG CCACGACACC 
AATGGAACCT ACCAGACCTG GGTGGCCACC 
ACCTGCTACA TGGAACACAG CGGGAATCAC 
AGGGTGACCC TGGAGAGGGT CAGGCCAGGG 
GCCCAGTGTA TAACAAGTCC CTTTTTTTCA 
CAGACATTCC ATGTTTCTGC TGTTGCTGCT 
TTCTATGTCC GTTGTTGTAA GAAGAAAACA 
GGGCAGTTTC TGGAGATGGT AAGGCCCCTG 



CTGGAGAGGC CTTTTCCAGA AAAGTTAGGG 
GGATCTGGCT CAGGCTGGGG GTGAGGAATG 
CCGGCACTCA GCCCACACAG GGAGGCATGG 
TGAGTTCCTC ACTTGGGTGG AAAGGTGATG 
GGGGGCAGGC TTGCATTCCC TCCAGGAGAT 
CACCAGGAGC TCCCAGCATT TCTACTACGA 
GACTGAGGAA. TGGACAGTGC .CCCAGTCCTC. 
GAATTTCTTG AAGGAAGATG CCATGAAGAC 
CTGCCTGCAG GAACTACGGC GATATCTAGA 
GGACGCTGGC CAGGGGCTCT CCTCTCCCTC 
ATGTGTCCAG GGAAACCCTC CCTGTGCTAT 
GTCCTGATTT TCCTCTATTG TTAGAGCCAC 
CATCCAGTGT TGTAATCAGG GCAAGTAGAG 
GGGTGTCAGG CAGGAGAGGA AGCCTTCAGG 
GCCCATCCTG GAGAGTTCCC TCCTGGCCCC 
CTCCTCAGCA TCAATGTGGG GATCCCAGAG 
CTGCCAGCCT GGAAGAACTG GGCCCCAGAG 
GAGGGCTTCA GCCAGAGTGA GAACAGTGAA 
AGAGGGGAGC AGGGCTTCAC TGGCTCTGCC 
GTCACCCGCA GCGAGGCCTC AGAGGGCAAC 
TATCCCCGGA ATATCACACT GACCTGGCGT 
CAGCAGTGGG GGGATGTCCT GCCTGATGGG 
AGGATTTGCC AAGGAGAGGA GCAGAGGTTC 
AGCACTCACC CTGTGCCCTC TGGTGAGCCT 
TAGGGACAGC AGGGATGGCT GTGGCTCTCT 
GGGAAAGTGC TGGTGCTTCA GAGTCATTGG 
GCTGCTGCTG CTATTTTTGT TATTATTATT 
TCAGCTGCAG AGGGTCCAGG TGAGAAAAGC 
TCTGGGCAGT AGGGTCCCCT CATTGCTCCT 
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GCAAAGATAG GCATGTTGGT GACAAGGCTT CTGTAACAGG GGATGAAAGT TGGGGAATTT 9000 

GGGAAGGGAA TGGGGGCAGC ATCTCCATCT ACACCCATAA GTGCTGCCCA AGCGAGGGTC 9060 

AAACGCCCAG CTGTGGCATC TTCCTGCTGC AGGTGAGGAG TGGGCAGCAG GGAGGGCTGC 9120 

GGCGCCTGCT CTGTCCCCAT CCCGGTCTCT GTGTCTCTTG GACTCACTAG GGCGCATCCA 9180 

GGTGGGGTGA GCTGGGAATC ACGTGCTGAA TGCTGAGGGC CTGGATGATC ACGGCCTCAG 9240 

AGGGAGCAAA TAGTAAAGGC AGCTGTGATC TGGGGAGGGC CAGAAACTGG AGAGGAATCT 9300 

GAGGAGAGGC GGTGCCCCTA. .TPCCCTTCCT CTOTGCATCC„CCCTCCCCTG„TTTCTCCAGC. 5360.. 

CATCGGGGCG GACACCGAGA AAAAGACCTA TGAGGCCCAG CCTGGGGGCC CTGCCTGTGT 9420 

AGCCCTTTGG AGACCCCTAG TAACAGGGAG GGTCCTGAGC ACACATGGCC ATCTCTGTCC 9480 

ACTGTGCAGC TCCCCATGCA CCTCCTCCAG GAGCTTTCTT GGGGTTGTCG TGTCCTCTGC 9540 

ACCATTCGAG GCCCTACTCT TTCCAGGTTC CCACGGCCTG GCCTCCCTGA GTTTCTTGCA 9600 

GATGACATGG ATGAGTAGAT AAGCAGATGT CCCTGGGCCA TTTGAGGAGT GGGGCCCAGC 9660 

CCCTCATCAG GGCAGCTGTG GTCCCTGTTT TCATCCTACC TCCGAGTGTT TTCTTCTCCA 9720 

GTCCCTGAGG GACACAGTCC TCAGGGCCCA TGTTTTTGGG GATTTAATCT GTGCTCTGTG 9780 

GCCTCACCTT GCCTTCCCTG AGCCAATTTC CCTTTCTAAA GGTGGTCACT GCCTGGTAAG 9840 

TTTGGAGTAA GGGACGGTCA GAATCATTTC CCCTACAGTC AGGTTGTTTG ATGGGGGATG 9900 

AAAAGAGACA GCAGGAAGTT TTGTGTTTCT GCAAAGACAG AAGCAGTTCA GGCGACAGTA 9960 

AGAGGCTGGG GTGTCCAGGA GGGTGTGTCT GGCAGTAGGG TCGCTGGTTT CTCATCCTTG 10020 

AACCTAATTG CACTGTCAGT CGGCCCCTCA GGCCTGAGCA GATGGGAAGG TTTGTCCCCT 10080 

GCCCTGCAGC AAGAGGGCCC TGTCCAGGAG GCACCCACAA CAGAGGCAGT GCAGGTCTGT 10140 

GGTCACTCCT ACTCTCACCT GTGGCGTCTC CCGTAGAGGG ATTGTCAGTT CTGGTTCCCT 10200 

GTGGGCAGGA ATGGTTTCCT CATAGGTCAC TGGAGTTTTG GCCAGGAAAA GAGTATGAAG 10260 

TTCATGTGGC AGTTTCTCAA AATTCCTGCT TTCAATGTTG ATGTCCAGTA AAGATATTCG 10320 

TAATTTCAGC TCTATAATCT TAATAGGATT TCCTCTAATA TTGTGAAGCA TATTATATGA 10380 

AACAGGAACA CAAATTTCTC AAAATTCCTG CGATGTCCAA TAAAGATTTT CATAATTTCA 10440 

GCTCTGCAAT CTTAATAGGA TTTCCTAATA CTGTAAAGCA TATTAAATGA AACAGGAACT 10500 

CAAATTTGGA GCCCCCTCTC CAGGAGGTTC TGTGTGGAGA TGGTGGCTGT GGCAGTGGCA 10560 

GTTCCCAGGT GCAGAGGGTG GGCAGAGGCA GCCTCAGGCT AAGGGGTCTC CCCTACTCCA 10620 

CATGGAGAAA ATCCCTTGTA GGTTGCAAGG GCAGTGGCCG GGTGGAATCC CTGCTAGGGA 10680 
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CAGAGCAGGA AGGCCTCGCA GCCTCACCAA GCAGCAGCCC TGGGGTGGAG CTGCGTTTCC 10740 

AGGGTTAAGC GGACCAGGCA GGAGTAGCGG TTACTCAAGA GCAGGTCACA GGCTTGGGTT 10800 

GTGAGGGTCA GGAGAGGCCA GGCCTCCTCG AGCAAGGTGG GGGTCCCAGG GTCAGGTCAG 10860 

GTGCAGATCC TGTGGCAGCC ACGTCTTTCC ATGCTGGGCC TGCTGGGCCC CCCAGGCTTC 10920 

CTGATGGGGT CCCCAGTTAG GAGCTGCCTG CTCAGGGCTG GGAGGGGAGG AGCACTGAGC 10980 

TGCAGATAGA GGGCAGAGCC CACAGTGGGC AGGGCCTGCC CTGGTGTGTA GGTGCCTCTG 11040 

CAGGAGAGGA GGGCCTGGGa . ACTGAGAGCA. AGGGTCAGGG_ CCTCTCTTTG- GGGAGGCCTC 11100 

TCACTGTAAC AGGACTGGTC AGGCCTGAGA GGAGGGCACT GGGTTCCCTC TTGGGTCTTG 11160 

TCCTTTAGTC TTGGGGCCCT TTCCCTCCCT GCACGATGAG TGGTGGGCAC AGGGCACGGG 11220 

CTGATGTTGA TGGAGTGATG GGAGGGAACT GGCAGGGGCT GGGAAAAGCA AGGAGGGAGG 11280 

AAGAAAAAAG TGGGGGCCTC ATCTTCCCTC AGAGAAAGGG CAAATCTGGT TTTGGAGCAA 11340 

CTGAAGAGAG AAAAGTCCCC AGGGAATAAA CACAACACTG CACCCAGTGG AGCATTTACC 11400 

CATTTCCCTC TTTTCTCCAG AGCTCGTGAG CCTGCAGGTC CTGGATCAAC ACCCAGTTGG 11460 

GACGAGTGAC CACAGGGATG CCACACAGCT CGGATTTCAG CCTCTGATGT CAGCTCTTGG 11520 

GTCCACTGGG TCCACTGAGG GCGCCTAGAC TCTACAGCCA GGCGGCTGGA ATTGAATTCC 11580 

CTGCCTGGAT CTCACAAGCA CTTTCCCTCT TGGTGCCTCA GTTTCCTGAC CTATGAAACA 11640 

GAGAAAATAA AAGCACTTAT TTATTGTTGT TGGAGGCTGC AAAATGTTAG TAGATATGAG 11700 

GCATTTGCAG CTGTGCCATA TT 11722 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 385 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Leu Gly Pro Val Phe Leu Leu Leu Ala Gly lie Phe Pro Phe 
15 10 15 

Ala Pro Pro Gly Ala Ala Ala Glu Pro His Ser Leu Arg Tyr Asn Leu 
20 25 30 

Thr Val Leu Ser Trp Asp Gly Ser Val Gin Ser Gly Phe Leu Ala Glu 
35 40 45 
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Val His Leu Asp Gly Gin Pro Phe Leu Arg Tyr Asp Arg Gin Lys Cys 
50 55 60 

Arg Ala Lys Pro Gin Gly Gin Trp Ala Glu Asp Val Leu Gly Asn Lys 
65 70 75 80 

Thr Trp Asp Arg Glu Thr Arg Asp Leu Thr Gly Asn Gly Lys Asp Leu 
85 90 95 

Arg Met Thr Leu Ala His lie Lys Asp Gin Lys Glu Gly Leu His Ser 
100 105 110 

Leu Gin Glu lie Arg Val Cys Glu lie His Glu Asp Asn Ser Thr Arg 
115 120 125 

Ser Ser Gin His Phe Tyr Tyr Asp Gly Glu Leu Phe Leu Ser Gin Asn 
130 135 140 

Val Glu Thr Glu Glu Trp Thr Val Pro Gin Ser Ser Arg Ala Gin Thr 
145 150 155 160 

Leu Ala Met Asn Val Arg Asn Phe Leu Lys Glu Asp Ala Met Lys Thr 
165 170 175 

Lys Thr His Tyr His Ala Met His Ala Asp Cys Leu Gin Glu Leu Arg 
180 185 190 

Arg Tyr Leu Glu Ser Ser Val Val Leu Arg Arg Arg Val Pro Pro Met 
195 200 205 

Val Asn Val Thr Arg Ser Glu Ala Ser Glu Gly Asn lie Thr Val Thr 
210 215 220 

Cys Arg Ala Ser Ser Phe Tyr Pro Arg Asn lie Thr Leu Thr Trp Arg 
225 230 235 240 

Gin Asp Gly Val Ser Leu Ser His Asp Thr Gin Gin Trp Gly Asp Val 
245 250 255 

Leu Pro Asp Gly Asn Gly Thr Tyr Gin Thr Trp Val Ala Thr Arg lie 
260 " 265 270 

Cys Gin Gly Glu Glu Gin Arg Phe Thr Cys Tyr Met Glu His Ser Gly 
275 280 285 

Asn His Ser Thr His Pro Val Pro Ser Gly Lys Val Leu Val Leu Gin 
290 295 300 

Ser His Trp Gin Thr Phe His Val Ser Ala Val Ala Ala Ala Ala Ala 
305 310 315 320 

Ala lie Phe Val lie lie lie Phe Tyr Val Arg Cys Cys Lys Lys Lys 
325 330 335 



Thr Ser Ala Ala Glu Gly Pro Glu Leu Val Ser Leu Gin Val Leu Asp 
340 345 350 
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Gin His Pro Val Gly Thr Ser Asp His Arg Asp Ala Thr Gin Leu Gly 
355 360 365 

Phe Gin Pro Leu Met Ser Ala Leu Gly Ser Thr Gly Ser Thr Glu Gly 
370 375 380 

Ala 
385 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGCCATGGG GCTGGGCCGG GTCCTGCTGT TTCTGGCCGT CGCCTTCCCT TTTGCACCCC 
CGGCAGCCGC CGCTGAGCCC CACAGTCTTC GTTACAACCT CATGGTGCTG TCCCAGGATG 
AATCTGTGCA GTCAGGGTTT CTCGCTGAGG GACATCTGGA TGGTCAGCCC TTCCTGCGCT 
ATGACAGGCA GAAACGCAGG GCAAAGCCCC AGGGACAGTG GGCAGAAGAT GTCCTGGGAG 
CTAAGACCTG GGACACAGAG ACCGAGGACT TGACAGAGAA TGGGCAAGAC CTCAGGAGGA 
CCCTGACTCA TATCAAGGAC CAGAAAGGAG GCTTGCATTC CCTCCAGGAG ATTAGGGTCT 
GTGAGATCCA TGAAGACAGC AGCACCAGGG GCTCCCGGCA TTTCTACTAC GATGGGGAGC 
TCTTCCTCTC CCAAAACCTG GAGACTCAAG AATCGACAGT GCCCCAGTCC TCCAGAGCTC 
AGACCTTGGC TATGAACGTC ACAAATTTCT GGAAGGAAGA TGCCATGAAG ACCAAGACAC 
ACTATCGCGC TATGCAGGCA GACTGCCTGC AGAAACTACA GCGATATCTG AAATCCGGGG 
TGGCCATCAG GAGAACAGTG CCCCCCATGG TGAATGTCAC CTGCAGCGAG GTCTCAGAGG 
GCAACATCAC CGTGACATGC AGGGCTTCCA GCTTCTATCC CCGGAATATC ACACTGACCT 
GGCGTCAGGA TGGGGTATCT TTGAGCCACA ACACCCAGCA GTGGGGGGAT GTCCTGCCTG 
ATGGGAATGG AACCTACCAG ACCTGGGTGG CCACCAGGAT TCGCCAAGGA GAGGAGCAGA 
GGTTCACCTG CTACATGGAA CACAGCGGGA ATCACGGCAC TCACCCTGTG CCCTCTGGGA 
AGGTGCTGGT GCTTCAGAGT CAACGGACAG ACTTTCCATA TGTTTCTGCT GCTATGCCAT 
GTTTTGTTAT TATTATTATT CTCTGTGTCC CTTGTTGCAA GAAGAAAACA TCAGCGGCAG 
AGGGTCCAGA GCTTGTGAGC CTGCAGGTCC TGGATCAACA CCCAGTTGGG ACAGGAGACC 
ACAGGGATGC AGCACAGCTG GGATTTCAGC CTCTGATGTC AGCTACTGGG TCCACTGGTT 
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CCACTGAGGG CGCCTAGACT CTACAGCCAG GCGGCCAGGA TTCAACTCCC TGCCTGGATC 1200 

TCACCAGCAC TTTCCCTCTG TTTCCTGACC TATGAAACAG AAAATAACAT CACTTATTTA 1260 

TTGTTGTTGG ATGCTGCAAA GTGTTAGTAG GTATGAGGTG TTTGCTGCTC TGCCACGTAG 1320 

AGAGCCAGCA AAGGGATCAT GACCAACTCA ACATTCCATT GGAGGCTATA TGATCAAACA 1380 

GCAAATTGTT TATCATGAAT GCAGGATGTG GGCAAACTCA CGACTGCTCC TGCCAACAGA 1440 

AGGTTTGCTG AGGGCATTCA CTCCATGGTG CTCATTGGAG TTATCTACTG GGTCATCTAG 1500 

AGCCTATTGT TTGAGGAATG CAGTCTTACA AGCCTACTCT GGACCCAGCA GCTGACTCCT 1560 

TCTTCCACCC CTCTTCTTGC TATCTCCTAT ACCAATAAAT ACGAAGGGCT GTGGAAGATC 1620 

AGAGCCCTTG TTCACGAGAA GCAAGAAGCC CCCTGACCCC TTGTTCCAAA TATACTCTTT 1680 

TGTCTTTCTC TTTATTCCCA CGTTCGCCCT TTGTTCAGTC CAATACAGGG TTGTGGGGCC 1740 

CTTAACAGTG CCATATTAAT TGGTATCATT ATTTCTGTTG TTTTTGTTTT TGTTTTTGTT 1800 

TTTGTTTTTG AGACAGAGTC TCACTCGTCA CCCAGGCTGC AGTTCACTGG TGTGATCTCA 1860 

GCTCACTGCA ACCTCTGCCT CC CAGGTTCA AGCACTTCTC GTACCTCAGA CTCCCGATAG 1920 

CTGGGATTAC AGACAGGCAC CACCACACCC AGCTAATTTT TGTATTTTTT GTAGAGACGG 1980 

GGTTTCGCCA AGTTGACCAG CCCAGTTTCA AACTCCTGAC CTCAGGTGAT CTGCCTGCCT 2040 

TGGCATCCCA AAGTGCTGGG ATTACAAGAA TGAGCCACCG TGCCTGGCCT ATTTTATTAT 2100 

ATTGTAATAT ATTTTATTAT ATTAGCCACC ATGCCTGTCC TATTTTCTTA TGTTTTAATA 2160 

TATTTTAATA TATTACATGT GCAGTAATTA GATTATCATG GGTGAACTTT ATGAGTGAGT 2220 

ATCTTGGTGA TGACTCCTCC TGACCAGCCC AGGACCAGCT TTCTTGTCAC CTTGAGGTCC 2280 

CCTCGCCCCG TCACACCGTT ATCGATTACT CTGTGTCTAC TATTATGTGT GCATAATTTA 2340 

TACCGTAAAT GTTTACTCTT TAAATAAAAA AAAAAAAAAA 2380 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Leu Gly Arg Val Leu Leu Phe Leu Ala Val Ala Phe Pro Phe 
15 10 15 

Ala Pro Pro Ala Ala Ala Ala Glu Pro His Ser Leu Arg Tyr Asn Leu 
20 25 30 
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Met Val Leu Ser Gin Asp Glu Ser Val Gin Ser Gly Phe Leu Ala Glu 
35 40 45 

Gly His Leu Asp Gly Gin Pro Phe Leu Arg Tyr Asp Arg Gin Lys Arg 
50 55 60 

Arg Ala Lys Pro Gin Gly Gin Trp Ala Glu Asp Val Leu Gly Ala Lys 
65 70 75 80 

Thr Trp Asp Thr Glu Thr Glu Asp Leu Thr Glu Asn Gly Gin Asp Leu 
85 90 95 

Arg Arg Thr Leu Thr His lie Lys Asp Gin Lys Gly Gly Leu His Ser 
100 105 110 

Leu Gin Glu lie Arg Val Cys Glu lie His Glu Asp Ser Ser Thr Arg 
115 120 125 

Gly Ser Arg His Phe Tyr Tyr Asp Gly Glu Leu Phe Leu Ser Gin Asn 
130 135 140 

Leu Glu Thr Gin Glu Ser Thr Val Pro Gin Ser Ser Arg Ala Gin Thr 
145 150 155 160 

Leu Ala Met Asn Val Thr Asn Phe Trp Lys Glu Asp Ala Met Lys Thr 
165 170 175 

Lys Thr His Tyr Arg Ala Met Gin Ala Asp Cys Leu Gin Lys Leu Gin 
180 185 190 

Arg Tyr Leu Lys Ser Gly Val Ala lie Arg Arg Thr Val Pro Pro Met 
195 200 205 

Val Asn Val Thr Cys Ser Glu Val Ser Glu Gly Asn lie Thr Val Thr 
210 215 220 

Cys Arg Ala Ser Ser Phe Tyr Pro Arg Asn lie Thr Leu Thr Trp Arg 
225 230 235 240 

Gin Asp Gly Val Ser Leu Ser His Asn Thr Gin Gin Trp Gly Asp Val 
245 250 255 

Leu Pro Asp Gly Asn Gly Thr Tyr Gin Thr Trp Val Ala Thr Arg He 
260 265 270 

Arg Gin Gly Glu Glu Gin Arg Phe Thr Cys Tyr Met Glu His Ser Gly 
275 280 285 

Asn His Gly Thr His Pro Val Pro Ser Gly Lys Val Leu Val Leu Gin 
290 295 300 

Ser Gin Arg Thr Asp Phe Pro Tyr Val Ser Ala Ala Met Pro Cys Phe 
305 310 315 320 



Val He He He He Leu Cys Val Pro Cys Cys Lys Lys Lys Thr Ser 
325 330 335 
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Ala Ala Glu Gly Pro Glu Leu Val Ser Leu Gin Val Leu Asp Gin His 
340 345 350 

Pro Val Gly Thr Gly Asp His Arg Asp Ala Ala Gin Leu Gly Phe Gin 
355 360 365 

Pro Leu Met Ser Ala Thr Gly Ser Thr Gly Ser Thr Glu Gly Ala 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ACTGGGGAAC AAGGTTTATA TGAGA 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TGTCACCCGT CTTCTACAGG ACCC 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 
GGGGCCATGG GGCTGGG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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ATCTGA6ATG TCGGTCC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGTTCTTGTC CCTTTGCCCG TGTGC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AACCCTTCCC TTACCCCCGT CGTAG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TATGTAAAAC GACGGCCAGT TTCACCTGTG ATTTCCTCTT CCCCA 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGTCTTTTCA ATCCCCGTCT CTCGTCCAGT ATCGACAAAG GACAT 



(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TATGTAAAAC GACGGCCAGT TTCGGGAATG GAGAAGTCAC 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGAGAGGAGA GGGAGGTTAA CCAGTATCGA CAAAGGACAT 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TATGTAAAAC GACGGCCAGT GTTCCTCTCC CCTCCTTAGA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AAAAAGTCCC TTTCACGACC ACCAGTATCG ACAAAGGACA T 



